Synthesis of listener vocalisations with imposed intonation contours
نویسندگان
چکیده
Synthesis of listener vocalisations is one of the focused research areas to improve emotionally coloured conversational speech synthesis. To communicate different intentions, a synthesiser should be capable of generating a broad range of vocalisations with different kinds of acoustic properties. However, the data collection for corpus based methods is necessarily limited in acoustic variability. This paper describes our approach to increase the acoustic variability of vocalisations in terms of intonation. After selecting the best candidate for a given target from among the available vocalisations, we use prosody modification techniques to impose a target intonation contour. In an experiment, we combine markedly distinct intonation contours with vocalisations differing in segmental form, using the prosody modification techniques MLSA vocoding, FD-PSOLA, and HNM. In a listening test, we evaluate the perceived naturalness of the resulting synthesised vocalisations, and assess the effect of segmental form, intonation contour and modification technique on perceived meaning.
منابع مشابه
Expression of speaker's intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis
Aiming to provide the synthetic speech with the ability to express speaker’s intentions and subtle nuances, we investigated the relationship between the speaker’s intentions that the listener perceived and sentence-final particle/intonation combinations in Japanese conversational speech. First, we classified F0 contours of sentence-final syllables in actual speech and found various distinctive ...
متن کاملProsodic models and speech synthesis: towards the common ground
Prosodic models have been extensively applied in speech synthesis. However, the necessity of synthesizing prosody has as yet not resulted in a generally agreed upon approach to prosodic modeling. This statement holds for the assignment of segmental durations as well as for generating F0 curves, the acoustic correlate of intonation contours. This paper concentrates on the use and usability of in...
متن کاملEpistemic and attitudinal meanings of rise and rise-plateau contours
This paper investigates the epistemic and attitudinal meanings of rise and rise-plateau contours in listing contexts. Previous accounts of list intonation have made claims about epistemic meanings for list intonation, though without experimental evidence. In our first study, a metalinguistic task, subjects perceived rise and rise-plateau contours in listing contexts as having epistemic but also...
متن کاملInventory of intonation contours for text-to-speech synthesis
This paper presents an intonation model which determines intonation contours over intonation phrases. The model is described by four elements: communicative type of an intonation phrase; number of accent groups in it; position of the nuclear accent group in it; and set of target intonation points. Individualization of the model is based on semiautomatic analysis of speaker database. The model w...
متن کاملA new model of intonation for use with speech synthesis and recognition
This paper describes a synthesis from analysis scheme for producing natural sounding intonation for speech synthesis. The paper presents a new method of describing F0 contours in terms of three basic phonetic intonation elements. Details are given of an automatic system for labelling F0 contours, which could be used for speech recognition purposes. Current work on extracting a phonological desc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010